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PARAMETER  ESTIMATION  AND  IDENTIFICATION 


FOR 

SYSTEMS  WITH  DELAYS 

H.  T.  Banks,  J.  A.  Burns  and  E.  M.  Cliff 

Abstract 

Parameter  identification  problems  for  delay  systems  motivated 
by  examples  from  aerodynamics  and  biochemistry  are  considered.  The 
problem  of  estimation  of  the  delays  is  included.  Using  approximation 
results  from  semigroup  theory,  a  class  of  theoretical  approximation 
schemes  is  developed  and  two  specific  cases  (•'ave raging1*  and  ’‘’spline* 
methods)  are  shown  to  be  included  in  this  treatment.  Convergence 
results,  error  estimates,  and  a  sample  of  numerical  findings  are  given. 
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1.  Introduction 


The  estimation  of  parameters  in  dynamical  systems  is  an  impor¬ 
tant  scientific  problem  on  which  a  number  of  contributions  have  been 
made  in  the  engineering  and  mathematical  literature  (e.g.  see  [1], 
[23]).  However,  for  systems  with  delays,  very  little  on  identifi¬ 
cation  is  found  in  the  engineering  literature  and  essentially  no 
theoretical  convergence  results  are  available  for  algorithms 
dealing  with  estimation  of  the  delays  themselves.  One  obvious  dif¬ 
ficulty  (from  both  a  practical  and  theoretical  viewpoint)  with  such 
procedures  is  that  solutions  of  delay  systems  are  not  in  general 
differentiable  with  respect  to  the  delays  and  thus  many  common 
(e.g.  least  squares  gradient,  maximum  likelihood  estimator,  etc.) 
identification  techniques  are  not  directly  applicable. 

In  this  paper  we  discuss  a  class  of  methods  based  on  general 
approximation  techniques  for  systems  with  delays.  These  approxima¬ 
tion  ideas  have  been  considered  earlier  in  the  context  of  optimal 
control  problems  ([3],  [4],  [5],  [6],  [7],  [9],  [12],  [18])  where 
they  have  proved  quite  useful.  The  use  of  such  approximation  ideas 
in  connection  with  parameter  estimation  procedures  was  apparently 
first  suggested  in  [11]  and  some  preliminary  theoretical  results 
were  stated  in  [7]  and  [13].  However,  our  presentation  here  is 
the  first  (to  our  knowledge)  rigorous  treatment  of  general  theoret¬ 
ical  aspects  of  these  ideas. 

While  we  do  in  section  7  below  give  a  small  sample  of  related 
numerical  findings,  the  primary  purpose  of  this  manuscript  is  to 
present  a  theoretical  foundation  for  the  schemes  we  propose.  A 
much  more  extensive  discussion  and  a  wider  selection  of  numerical 


examples  is  presented  in  [8].  Our  sample  of  numerical  results  in 
section  7  is  included  here  mainly  to  indicate  that  the  procedures 
based  on  the  schemes  discussed  actually  are  feasible. 

The  approximation  ideas  developed  earlier  in  [5]  and  employed 
here  are  based  on  approximation  results  (the  so-called  Trotter- 
Kato  theorem)  from  linear  semigroup  theory.  In  section  2  we 
formulate  a  class  of  identification  problems  for  delay  systems  and 
show  that  they  can  be  reformulated  in  an  abstract  setting  so  as  to 
make  use  of  the  semigroup  approximation  theorem.  A  version  of  the 
Trotter-Kato  results  needed  is  given  in  section  3,  while  in  section 
4  we  show  how  to  use  this  theorem  to  insure  convergence  for  a 
class  of  identification  schemes.  We  turn  to  the  detailed  develop¬ 
ment  of  particular  schemes  based  on  "Averaging”  (see  [5])  and 
"Spline"  (see  [10])  approximations  in  the  subsequent  two  sections. 
Finally,  a  brief  indication  of  numerical  findings  for  these  two 
particular  schemes  is  given  in  section  7. 

Notation  used  throughout  the  paper  is  completely  standard. 

For  example,  Lp(a,b)  =  Lp([a,b],Rm)  denotes  the  usual  Lebesgue 
spaces  of  Rm-valued  "functions"  on  [a,b]  whose  components  are 
integrable  when  raised  to  the  pth  power.  When  m  =  1,  we  shall 
suppress  its  appearance  in  the  notation.  Lp  denotes  the 
usual  "locally"  integrable  function  spaces.  We  shall  use  the 
symbol  |*|  to  denote  the  norm  of  an  element  without  distinguishing 
between  different  norms  if  the  intended  meaning  is  clear  from  the 
context.  The  space  of  functions  with  j  continuous  derivatives 
is  denoted  by  (^(a.b).  We  shall  also  make  use  of  the  Sobolev 
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spaces  Wp^(a,b)  ®  Wp*^  Qa,b]  ,Rn)  of  Rn-valued  absolutely  continuous 

functions  possessing  j-1  absolutely  continuous  derivatives  and 

ith  derivatives  that  are  in  L  . 

P 

In  the  remaining  paragraphs  of  this  introductory  section,  we 
turn  to  a  discussion  of  examples  which  motivate  the  theoretical 
questions  that  are  the  focus  of  our  attention  in  this  paper. 
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L .  Tubular  reactor  columns  and  delay  system 
identification  and  control  problems 

Packed  bed  tubular  enzyme  reactors  are  very  important  in 
many  areas  of  industrial  and  biological  applications  (potential 
uses  involve  purification  or  classification  of  fruit  juices, 
proteolytic  treatment  of  beer,  synthesis  of  essential  amino  acids, 
enzymatic  biosynthesis  -  i.e.,  synthesis  of  antibiotics  and  steroids., 
etc.).  These  are  column  reactors  (as  depicted  in  Figure  1.1)  con¬ 
taining  enzyme  pellets  (i.e.  pellets  in  which  an  enzyme  is  insolubly 
bound),  the  enzyme  being  specific  for  a  substrate  S  which  is 
passed  through  the  column.  The  substrate  diffuses  into  the  pellets 
where  the  enzyme  catalyzes  a  reaction  resulting  in  the  product  P. 


FIGURE  1.1 


5 


We  thus  have  enzymatically  active  particles  or  pellets  in  a 
convective  flow  region.  Any  model  should  embody  important  features 
of  the  system  including  (i)  enzyme  catalyzed  reaction,  (ii)  metabolite 
(S  or  P)  diffusion  into,  out  of,  and  inside  of  the  pellets,  and 
(iii)  metabolite  convection  (and  possibly  diffusion)  in  the  flow 
region  in  the  column  exterior  to  the  pellets.  Extensive  studies 
for  both  plug-flow  (PF)  models  and  diffusion-convection-reaction 
(DCR)  models  for  the  phenomena  involved  have  been  reported  in  the 
literature  [15],  [16].  These  models  can  be  formulated  from  first 
principles  using  transport  equations  of  the  form 


9s 

at 


+  c 


as 

3x 
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where  D^,D2  are  diffusion  coefficients,  c  is  the  convective 
flow  velocity,  x  is  the  column  axial  direction,  y  the  perpendicular 
direction  (in  a  2-dimensional  model),  and  V  is  a  nonlinear  reaction 
velocity  approximation  (e.g.,  V  =  -P  jfj)  •  The  column  in  this  case 
is  approximated  by  a  two-compartment  (pellet  phase  and  liquid  phase) 
model  as  shown  in  Figure  1.2. 

PF  models  incorporate  assumptions  that  one  may  ignore  diffusion 
in  both  the  pellet  and  solution  (flow)  regions.  Careful  investigation 
of  these  models  reveal  that  they  are  of  limited  use  in  actual 
applications  since  it  is  found  that  certain  kinetic  constants  must 
actually  be  allowed  to  vary  (in  an  unpredictable  manner)  with  the 
flow  velocity  in  order  to  fit  the  models  to  experimental  data.  On 
the  other  hand,  the  DCR  models  were  found  to  perform  quite  adequately 
when  compared  with  the  data.  The  main  difficulty  in  employing  the 
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PELLET  PHASE 


LIQUID  PHASE 


FIGURE  1.2 


DGR  models  involves  the  rather  lengthy  calculations  that  must 

be  made  in  carrying  out  identification  and  control  procedures  with 
these  models.  It  is,  therefore,  desirable  to  have  a  model  which  in 
complexity  and  accuracy  (hopefully  similar  to  the  PF  models  with 
respect  to  the  latter)  is  somewhere  between  the  PF  and  DCR  models 
and  for  which  efficient  numerical  procedures  are  available. 

A  candidate  for  such  a  model  has  been  proposed  by  J.P.  Kernevez 
and  his  colleagues  at  University  de  Technologie  de  Compibgne.  It 
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consists  of  n  functional  compartments  for  the  column,  each  con¬ 
taining  two  subcompartments,  one  representing  the  pellet  phase  and 
the  other  the  liquid  phase.  The  two  subcompartments  in  each 
compartment  are  connected  by  diffusion  while  the  main  compartments 
are  connected  via  unidirectional  transport  (convective  flow)  be¬ 
tween  the  liquid  phase  subcompartments  (see  Figure  1.3). 


Figure  1,3 
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Defining  variables  as  follows  (all  are  scaled  and  dimensionless): 

r^t)  =  substrate  concentration  in  liquid  phase  in 
compartment  i  at  time  t, 

s . (t)  =  substrate  concentration  in  pellet  phase  in 
compartment  i  at  time  t, 

P^Ct)  =  product  concentration  in  liquid  phase  in 
compartment  i  at  time  t, 

q^ (t)  =  product  concentration  in  pellet  phase  in 
compartment  i  at  time  t, 

one  can  write  mass  balance  equations  to  obtain  a  model 

-ar^t)  -  «{r1(t)  -  s^t-x^}  +  u(t) 
a{ril(t-x)  -  ri(t)>  -  0(ri(t)  -  si(t-t1)},  i  >  1, 

-PF(si(t))  +  ve{ri(t)  -  si(t-t1)},  i  >  1, 

-otp1  (t)  -  B(p1(t)  -  q1(t-x2)} 

°{pi.1(t-t)  -  pi(t)}  -  B(pi(t)  -  qi(t*t2)},  i  >  1, 

PF(s.(t))  +  vg{p.(t)  -  q.(t-x2)},  i  >  1. 


3t" 
dri  _ 

ar~  ' 

ds^ 
XT  = 

dp, 
Xt~  * 

5 £i  . 

dt 

dq^ 

dt 
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Here  F  is  a  nonlinear  reaction  velocity  term,  the  delays  z>Ti>T2 
are  transport  times  between  compartments  i-1  and  i,  between  pellet 
interior  and  liquid  region  for  substrate,  and  between  pellet  interior 
and  liquid  region  for  product,  respectively.  The  term  u  represents 
input  of  substrate  to  the  liquid  subcompartment  of  compartment  1. 

The  parameters  a,3,P,v,B  are  all  related  to  biochemical  and  physical 
constants  for  the  column  configuration.  For  example,  3  =  NDs£/eV 
where  N  is  the  "apparent"  number  of  pellets  per  compartment, 

V  =  volume  of  liquid  per  compartment,  Ds  =  coefficient  of  diffusion 
for  S  within  the  pellet  region,  e  =  "thickness"  of  the  model 
pellet  region,  and  £  =  effective  surface  area  per  pellet. 

Using  data  collected  from  a  number  of  specific  experiments 
performed  with  tracer,  product,  and  substrate  inputs,  one  wishes  to 
determine  values  of  T  » Ti  »x2  »  p  *  so  t*iat  the  model  describes 

accurately  the  operation  of  the  column.  Once  this  is  done,  the 
model  then  must  be  used  to  design  (optimal)  control  procedures  for 
the  column. 

We  thus  have  classical  identification  and  control  problems 

T 

for  systems  (let  xA  =  (ri  ,si  ,Pi  ) 


*i(t)  =  AQ(Y)xi(t)  +  A1(Y)xi(t-T1)  +  A2(Y)xi(t-x2) 

♦  f (Y,xi_1(t-r) ,  xi(t),u(t)) 

where  the  delays  an(*  vector  parameter  Y  (involving 

only  coefficients)  are  to  be  identified. 


P  •’Tf’ 
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1.2.  Identification  problem  for  hereditary  systems 
in  unsteady  aerodynamics 

We  consider  next  an  interesting  class  of  identification 
problems  which  arise  in  the  study  of  unsteady  aerodynamics  (see  [  2  ]). 
Consider  a  thin,  flat  airfoil  mounted  on  springs  as  shown  in 
Figure  1.4  in  a  region  where  we  have  fluid  (air)  flow  with  un¬ 
disturbed  stream  velocity  U  (in  the  x-direction) .  Flow  around 
the  airfoil  is  disturbed  and  we  assume  it  has  velocity  q  =  (u,w). 

Laws  of  conservation  of  mass  and  momentum  lead  to  a  system  cf  partial 
differential  equations  for  the  fluid  velocity  components  u  and  w. 
Assuming  incompressible  flow  we  have  the  continuity  equation 
V*q  =  0.  Elementary  hydrodynamics  also  yield  that  curl  (q)  =  0 
from  which  we  deduce  the  existence  of  a  velocity  potential  tp  so  that 
q  =  V(p.  The  equation  of  continuity  then  becomes  A<p  =  0 .  We 
restrict  our  considerations  to  small  motions  of  the  airfoil  so  that 
a  linearized  theory  may  be  adopted.  We  assume  that  cp  is  given  by 

ip(x,z,t)  =  tp(x,z  ,t)  +  Ux 

where  ip  is  a  disturbance  potential.  It  follows  that  <p  must 
satisfy 

(1.1)  Aip  =  0. 

In  addition  one  has  the  (flow  tangency)  boundary  conditions 


— -1 — 
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(1.2)  !“  (x,0,t)  =  w(x, 0 ,  t)  =  wa(x,t)  -1  <  X  <  1 

where  w  is  a  given  function  describing  the  motion  of  the  airfoil. 

We  here  assume  that  the  airfoil  is  a  thin  plate  located  at  z  =  0, 

-1  <  x  <  1  as  depicted  in  Figure  1.  5. 

After  arguments  involving  a  conformal  mapping  of  the  airfoil 

into  the  unit  circle  and  the  introduction  of  sources  (elementary 

flows  along  radial  lines)  and  vortices  (elementary  flows  along 

concentric  circles),  me  finds  that  a  solution  of  (1.1), (1.2)  for  the 

disturbance  potential  <p  consists  of  an  appropriate  collection  of 

sources  distributed  along  the  airfoil  and  any  weighted  combination  of 

"compatible"  vortex  pairs.  A  "compatible"  pair  consists  of  one 

vortex  on  the  airfoil  at  r  =  r^  <  1  and  an  oppositely  rotating  one 

at  r  =  —  >  1.  Compatible  pairs  induce  a  flow  with  finite  angular 
rl 

momentum  and  with  fluid  velocity  that  is  tangent  to  the  airfoil.  The 

required  distribution  of  sources  is  uniquely  defined  by  the  airfoil 

motion  (w  (x,t)  in  equation  (1.2))  but  the  distribution  of  vortices  in 

the  wake  given  by  a  density  function  Yw(5,t)  is  as  yet  unknown. 

In  lieu  of  y  we  introduce  a  new  function  T  termed  the  circulation, 
w 

For  brevity  we  shall  "define"  F  by 

(1.3)  f(t  -  li^)  =  -  VS,t) 

with  the  boundary  condition  r(-°°)  =  0.  This  relationship  reveals 
that  vorticity  in  the  wake  at  time  t  and  position  5  was  produced 
by  a  change  in  the  circulation  at  an  earlier  time,  i.e.  an 
hereditary  phenomenon  is  involved.  In  integrated  form  (using 


Figure  1.5 


Figure  1,6 


r(-»)  =  o)  this  becomes 


(1.4)  r(t)  -  J  Yw(5,t)dC  =  0. 

To  determine  r  (or  y^)  we  impose  an  additional  hypothesis, 
viz.  finiteness  of  the  fluid  velocity  at  the  trailing  edge  of  the 
airfoil.  Mathematically  this  is  written 

(1.5)  v(t)  +  /U  Yw(5,t)de  =  0. 

Here  v  is  the  contribution  to  the  velocity  due  to  the  source  distri¬ 
bution.  Subtracting  (1.5)  from  (1.4)  and  using  (1.3)  we  thus  obtain 

,00 

(1.6)  r(t)  =  v (t)  ♦  J  f(or(t  -  -^^-)d5 

where  f(5)  =  /(5  +  1)  ./(g-1)  -  1.  This  finally  is  our  model  equation 
(see  [  28],  p.  292)  that  is  the  basis  of  hereditary  models  in  un¬ 
steady  aerodynamics. 

A  simple  change  of  variables  €  =  1  -  o  in  the  integral  in  (1.6) 
yields  the  equation 

,0  „ 

(1.7)  r(t)  =*  v(t)  +  f (o)r(t  +  g)do 

J  -OO 

where  f(a)  =  f(l-a).  This  is  essentially  a  neutral  functional 
differential  equation  with  infinite  memory.  Among  the  numerous 
approximations  made  in  the  derivation  of  such  a  model  is  the 
expression  for  f 
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Cl. 8)  f(5)  -  /(C+l)7CE-'U  -  l. 


It  turns  out  that  the  transverse  velocity  component  w  exhibits  a 
boundary  layer  phenomenon  as  sketched  in  Figure  1.6  where  r  is  the 
horizontal  distance  from  the  trailing  edge  of  the  airfoil.  Thus, 
the  expression  for  f  in  (1.8)  is  valid  only  for  C  >>  1.  To  better 
approximate  this  phenomenon  in  (1.7)  one  might  approximate  f  by  a 
function  g  having  the  form 


r 


g(°;a,M) 


ao 

e?(o) 

V. 


-p  <  o  <  0 
-oo  <  a  <  -  P 


when  it  is  understood  that  cx ,  g  must  be  chosen  so  that  g  is 
continuous  at  a  =  -p.  The  model  then  is  given  by 


(1.9) 


r(t)  =  V(t)  +  g(a;«,e,u)r(t  + 


£)da. 


Assuming  smoothness  of  g  we  formally  integrate  by  parts  the 
integral  in  (1.9)  to  obtain 

r(t)  =  v(t)  -  g(c)r(t  +  g)do 

J  -00 

or  letting  s  =  t  +  ^  in  the  integral  and  defining  G(£)  =  g(US), 
we  have 


(1.10) 


ret)  =  v(t) 


l 


t 

G(s-t)T(s)ds. 

-00 
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Equation  (1-1°)  is  a  retarded  FDE  with  infinite  memory  which,  upon 
differentiation,  yields  the  more  familiar  form 


r(t)  =  v(t) 


G 


(O)r(t)  +  | 


t  . 
G( 


-00 


s- t)T (s)ds . 


In  practice,  one  often  would  desire  to  replace  the  integral  term  by 
a  finite  integral 


(s-t)r(s)ds 


in  which  case  one  obtains  an  equation  Ctaking  TCt)  =  xCt),  observing 
that  G(0)  =  g(0)  =  «  and  identifying  v(t)  =  u(t)  as  the  input) 


(1.11)  *Ct)  =  -°x(t)  + 


J  -t 


Ug(U[s-t]  ;<*,$, h)x(s)ds  +  u(t) 


An  important  identification  problem  then  consists  of  making  observa¬ 
tions  corresponding  to  an  input  u(t)  =  v(t)  and  using  these  to 
estimate  the  parameters  a,g,p,  and  x  so  that  the  model  yields  a 
sufficiently  accurate  description  of  the  aerodynamic  phenomena  under 
investigation. 


The  fundamental  identification  problem  for  delay  systems 
We  consider  in  this  paper  n-vector  systems  of  the  form 


(2.1)  x(t)  -  L(q)xt  +  B(a)u(t),  t  >  0, 

with  initial  data 

(2.2)  x(0)  =  n,  x0  =  4>,  (n,<J>)  £  Rn  *  Lj(-rf0), 


and  output 


(2.3)  y  (t)  =  C(a)x(t)  +  D(a)u(t). 


We  make  the  following  definitions  and  assumptions  about  the 
operators  and  parameters  in  (2.1) -(2. 3).  There  exists  a  fixed 
given  r  >  0  and  compact  convex  set  n  «=  and  we  define  the 
compact  convex  set  Q  c  Ry  +  V  by  Q  s  ft  *  & ,  where 


{ h  -  (rx,r2, 


r .  < 
i  - 


ri+l  5  r’ 


=  1 , . . .  ,v-l} 


For  a  function  x  we  adopt  the  usual  notation  xt(6)  =  x(t+9). 

For  a  given  element  q  =  (a,h)  in  the  admissible  parameter  set  Q, 
we  define  the  operators  L(q):  L2(-r,0)  Rn  of  (2.1)  by 


(2-4)  L(q)4> 


v 

I  A.(a)H-ri) 
i  =  0  1  1 


KC°»9)(K8)d9 

v 


where  rQ  e  0,  and  for  each  ot  £  n,  A. (o) ,B(a) ,C(a) ,  and  D(«)  are 
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n  x  n,  n  *  m,  k  x  n,  and  k  x  m  matrices  respectively.  We  assume 
that  the  n  x  n  matrix  valued  function  0  -*■  K(a,8)  is  in  l^C-r.O) 
and  that  the  functions  A^,B,C,D,K(  ,  •)  are  continuous  in  a. 

Remark  1:  In  (2.4)  one  must  give  the  proper  interpretation  to 
point  evaluations  in  the  event  <t>  is  only  an  "function". 

Since  in  (2.1)  we  are  interested  in  integrals  of  the  system,  the 
usual  interpretation  is  intended  here  (see  [  1 o ]  for  a  more  detailed 
discussion) . 

We  further  assume  that  we  are  given  an  initial  data  set 
5/  e  Rn  x  L^C-r.O)  that  is  closed,  bounded  and  convex  and  we 
define 


r  =  .S^xQ  =.yxnx#' 

as  our  admissible  initial  data  -  parameter  set.  Elements  Y  in  r 
will  be  denoted  in  one  of  several  ways  throughout  our  discussions 
below: 


Y  =  (h»^*q)  =  (h,<l> ,« ,h)  =  ( n fa , r j , . . . , r v) 

where  q  =  (a,h)  =  (a ,r^ . rv) .  For  each  Y  =  (n,4>,q)  in  r 

we  shall  denote  the  output  to  (2.1) -(2. 3)  at  time  t  >  0  by 
y  =  y(t;  Y) . 

Identification  of  the  system  variables  Y  in  (2.1)- (2. 3)  is 


based  on  input-output  information.  Given  a  piecewise  continuous 
control  input  u  defined  on  some  time  interval  [0,T],  one  samples 
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the  system  at  times  { t ^ ,  0  <  t^  <  tj  <  ...  <  t^  <  T,  to  obtain 
observations  { y ^ €  R^,  i  =  1,2,...,M.  One  can  then  perform 
a  least  squares  fit  to  data  (or  seek  a  maximum  likelihood  estimator 
for  y) •  Formally,  we  may  state  this  as  follows: 

Problem:  Given  the  input  u  and  observations  {y^}  at  times  { t - } , 

it  it  it  it 

find  Y  =  (b  ,$  ,q  )  in  T  which  minimizes  the  fit  error 

(2-5)  J(Y)  =  \  l  |  y  (t  -  ;  Y)  -  yj2. 

1  i=l  1  1 

Remark  2:  Whenever  r*  <  r,  one  only  needs  <l>*  defined  on 
[-r*,0]  in  order  to  obtain  a  solution  to  (2.1)- (2. 3)  (in  practice, 
this  is  exactly  what  we  shall  obtain).  However,  we  can  view 
(1*,<I>*)  as  an  element  of  S/  by  making  a  simple  (arbitrary  but 
definite)  backward  extension  of  to  all  of  [-r,0J. 


\ 

f 


Let  r  >  0  be  fixed  and  given  as  in  the  previous  section 
and  define  Z  =  Rn  *  L^C-r.O).  For  q  =  (<*,h)  €  Q  and  (n,4>)  €  Z, 
define  for  t  >  0  the  mappings  S(t;q):  Z  -*■  Z  by 

S(t;q)(n,4>)  =  (xCt;y)  ,xt(Y)) 

where  x  is  the  solution  to  (2.1)  with  u  =  0  and  xt(0)  =  x(t+0), 
-r  <  0  <  0.  It  is  easily  verified  that  for  each  q,  (S(t;q)}t>0  is 
a  strongly  continuous  semigroup  of  linear  operators  on  Z.  Further¬ 
more,  one  finds  [  5  ]  that  the  infinitesimal  generator  j^Tq)  ,  with 
domain 

^C^(q))  •  9  •  ( (n,4>)  e  z|  +  e  w^^-r.o),  n  =  ♦  (<>)>, 

is  given  by 

-artq)  (♦«>),♦)  =  (LCq)*  ,♦)  - 

We  note  that  for  q  €  Q,  q))  does  not  depend  on  q  itself. 

However,  for  k  >  1,  (q))  does  depend  on  q.  For  example, 

^0^2(q))  -  UH0),*)  |  ♦  €  W^2)(-r,0),  <P  (0)  =  L(q)<H. 

If  we  define  the  operators  B(a):  Rm  ■+■  Z  and  6(a):  Z  -*•  R^ 
by  §(a)u  =  (B(a)u,0)  and  6(a)  (n,$)  =  C(a)n,  then  the  delay 
system  (2.1) -(2. 3)  is  formally  equivalent  to  the  abstract  ordinary 
differential  equation  (ODE)  system 
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(2.6)  z(t)  =  j*(q)z(t)  +  B(a)u(t),  t  >  0, 

(2.7)  z (0)  =  (0,0) 

(2.8)  y(t)  =  C(a)z(t)  +  D(<*)u(t). 

As  in  the  usual  theory  dealing  with  semigroups  and  abstract 
differential  equations,  a  mild  solution  to  (2.6)-(2.8)  can  be  given 
by  a  variation  of  parameters  formula.  Specifically,  (2. 6) -(2. 7) 
has  the  mild  solution  z(t)  «  z(t;Y,u)  given  by 

(2.9)  z(t)  =  S(t;q)(o,0)  +  [  S(t-o;q)B(ct)u(o)do. 

J  0 

It  is  a  happy  circumstance  that  (2.9)  is  actually  equivalent  to 

(2.1) - C2 . 2)  in  a  strong  sense  as  we  now  state  precisely  (for  proof 
see  [  4  ] ,  [  5  ]  or  [  6  ]) . 

Theorem  2.1.  Let  x(-;Y,u)  denote  the  solution  to  (.2.1)  -  (2.2) 
corresponding  to  Y  €  Z  *  Q  and  u  E  L2  loc'  Then,  for  all  t  >  0 

z (t ; Y,u)  =  (x(t;Y,u) ,xt(Y,u)) . 

In  view  of  the  above  equivalence  results,  the  I.D.  problem  for 

(2.1) -  ('2. 3)  posed  above  can  be  reformulated  in  terms  of  an  abstract 

I.D.  problem.  That  is,  given  input  u  and  observations  {y^}  at 
times  find  y*  a  (n*,0*,q*)  in  T  so  as  to  minimize  J(Y) 

as  given  in  (2.5)  when  now  y(t)  is  given  by  (2.8)  and  (2.9)  in 


U 
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place  of  02.3).  Whether  the  problem  is  formulated  in  terms  of 

(2.8),  (2.9)  or  (2.1)-(2.3),  it  is  clear  that  we  are  dealing  with 

I.D.  problems  involving  infinite  dimensional  state  systems. 

Formulation  in  the  framework  of  the  Hilbert  space  Z  only  emphasizes 

this  and  is  in  no  way  an  essential  factor  in  the  infinite 

dimensionality  (and  the  associated  difficulties)  of  the  problem. 

Our  main  interests  here  are  identification  schemes  that  will  result 

in  computationally  efficient  algorithms.  The  approach  we  take  is  a 

classical  one  of  the  Ritz  type.  We  shall  choose  a  sequence  of  finite 

dimentional  problems,  each  of  which  is  defined  on  a  finite  dimensional 

state  space  and  approximates  the  original  I.D.  problem  in  Z. 

By  appropriate  choices  of  the  sequence  {X^}  and  the  corresponding 

approximating  problems,  we  hope  to  obtain  a  sequence  of  more  easily 

N  m  M  N 

solved  problems  with  solutions  Y  *  On  ,<j>  ,q  )  which  converge  to 

a  solution  y*  of  our  original  problem. 

Fundamental  to  this  endeavor  is  the  convergence  of  the 

underlying  approximating  systems  to  the  original  system  (2.9).  Our 

formulation  in  a  functional  analytic  framework  will  allow  us  to 

utilize  abstract  approximation  theorems  from  semigroup  theory 

(e.g.,  see  [  5  ]).  The  problems  here,  however,  are  a  little 

different  from  the  control  problems  of  [  5  ]  where  one  chooses  a 

N 

sequence  of  subspaces  Z  c  Z  on  which  to  solve  approximating 
control  problems.  The  I.D.  problems  to  be  treated  below  pose  some 
additional  difficulties  in  that  for  each  value  of  N,  the  "state" 
space  changes.  That  is,  the  natural  space  for  (2.9)  with 
qN  -  (<*N,rJ,. . .  ,rj)  is  ZN  =  Rn  *  L^-r^O)  which,  in  addition 
to  varying  with  N,  is  not  a  subspace  of  the  original  space 
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Z  =  Rn  x  L^C-r.O).  The  approximating  spaces  XN  clearly  should  be 
chosen  so  that  c  Z^. 

There  are  abstract  approximation  theorems  (motivated  by 

differencing  schemes  for  partial  differential  equations  and 

applications  from  probability  theory)  available  in  the  literature 

in  the  case  where  Z.  For  example,  the  original  Lax,  Trotter, 

Kato  efforts  [20],  [26],  [17]  resulted  in  such  theorems  as  did  the 

later  efforts  of  Kurtz  [19].  However,  all  of  these  versions  of 

the  approximation  results  (and  all  others  with  which  we  are  familiar) 

require  the  spaces  XN  to  approximate  Z  in  the  sense  that  there 

exist  projection-like  mappings  PN:  Z  XN  which  satisfy  a  norm 

convergence  criterion  |PMz|  -*•  |  z  |  _  as  N  00 .  For  the  problems 

N  XN  L 

and  approximations  we  shall  discuss  below  such  a  criterion  is  not 
met  (in  general,  one  will  not  have  r^  -*■  r,  where  r  is  the  a 
priori  chosen  upper  bound  for  the  hereditary  effects  in  the  systems). 
We  shall,  therefore  be  obligated  to  state  and  prove  an  appropriate 
version  of  the  abstract  approximation  results  and  this  is  done  in 
the  next  section.  The  arguments  used  to  establish  this  theorem  are 
very  similar  to  the  standard  ones  found  in  the  literature.  One  has 
a  sequence  of  approximating  infinitesimal  generators  (i.g.'s)  AN 
which  converge  in  some  sense  to  an  i.g.  A.  This  convergence  is 
sufficient  to  imply  convergence  of  the  resolvents  R^(AN)  to  R^ (A) . 
These  are  the  Laplace  transforms  of  the  corresponding  semigroups 
SN(t),S(t)  respectively  and  their  convergence  is  enough  to  guarantee 
the  desired  convergence  S^(t)  ■+  S(t).  We  make  this  more  precise 
in  the  next  section. 


3 .  An  abstract  approximation  theorem 

Let  Z  and  Z^,  N  =  1,2,...,  be  Hilbert  spaces  with  norms 

|*|  and  I’lfl  respectively.  Let  be  a  closed  subspace  of 

and  Z^  -*■  be  the  canonical  projection  of  ZN  onto 

XN  along  Xj^.  Suppose  Z  -*■  Z^  is  a  mapping  satisfying 

Im(>N)  =  ZN  and  |  >Nz|N  <  |z|  for  z  £  Z.  Finally  define 

PN:  Z  XN  by  PN  =  (In  our  discussions  for  the  I.D. 

problem  above,  Z  -  Rn  *  Lj(-r,0),  ZN  =  Rn  *  LJ(-rJ[,0)  ,  XN  is  an 

approximating  space  such  as  the  AVE  spaces  of  [  5  ]  or  the  spline 

spaces  of  [  10  ]  -  these  will  be  discussed  fully  below.  Finally, 

is  the  operator  that  takes  z  =  in  Z  into  z  =  (n,<t>) 

where  is  the  restriction  of  <J>  to  [-r^,0].  We  note  that  in 

this  case  we  would  not  expect  to  have  |PNz|N  •*  |z|  for  z  6  Z 
N 

unless  rv  -*■  r  and  itself  has  certain  convergence  properties.) 

We  adopt  the  following  standard  notation  for  the  presentation 
of  our  fundamental  approximation  results.  For  a  Hilbert  space  X, 
we  write  B  €  G(M,0)  to  mean  B:  ^(B)  c  X  X  is  the  i.g.  of  a 
Cg-s.g.  (T(t)}  satisfying  |T(t)|  <  Me^t.  We  also  denote  the 
resolvent  (Xl-B)  1  by  RX(B)  and  recall  that  R^(B)x  = 

f  e'X°T(o)xda. 
j0 


Theorem  3.1.  Let  Z,Z^,X^,  and  P^  be  given  as  above.  Suppose 
for  some  M,0  we  have  AN  e  G(M, B)  on  XN  and  A  €  G (M, B)  on  Z. 
Further  suppose  there  exists  @  c  ^(A) ,  @  dense  in  Z  such  that 
(i)  Rx  (A)S>  c  9  for  Re  X  >  0  > 

(3.1) 

(ii)  for  every  z  6  jA^P^z  -  P^Az|^  0  as  N  -*•  00 . 
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Then  for  every  z  €  Z 

(3.2)  |SN(t)PNz  -  PNS(t)z|N+  0  as  N  +  ~, 

and  the  convergence  is  uniform  in  t  on  compact  intervals.  Here 
is  the  i.g.  for  SN(t) ,  A  the  i.g.  for  S(t). 

Remark  3.1.  Implicit  in  the  statement  and  proof  of  Theorem  3.1  is 

the  assumption  that  P^z  £  i^(A^)  for  every  z  £  Z.  In  our  use 

of  the  theorem  for  I.D.  schemes  below,  XN  will  be  finite 

dimensional  and  £^(AN)  =  X„.  Indeed,  we  shall  find  A^  bounded 
ANt 

with  SN(t)  =  e  . 

Proof :  Let  X  be  fixed  throughout  with  Re  X  >  8  so  that 
Rx (An) ,  Rx (A)  exist.  We  first  establish  that  for  every  y  £  Z 

(3.3)  |Rx(AN)PNy  -  PNRx(A)ylNl  Re  X-g  l^ANPN  "  PNA-*RX  I N* 

From  the  definition  of  the  resolvent  operator  we  have  for  any 
operator  B 

RX(B)B  =  BRX(B)  *  XRX(B)  -  I. 

In  particular 


WANPN  =  XMAn)PN  '  PN 

pnarxca)  =  xpnrx(a)  -  pn 


it 


’"■sra 
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so  that 

MVWx<A>  -  Rx(WRx<A>  ■  raCan>pn  '  pnra  <a>  • 

Hence  for  any  y  €  Z  we  have 

I  MW  - 

IRxCAn)[AnPn  -  PNA]Rx(A)y|N 
-  Re  X- 6  l^ANPN  "  PNA)Rx(A)ylN’ 

the  last  inequality  following  from  the  fact  that  AN  €  G(M,3). 

Next,  for  given  z  €  &  where  9  is  as  in  the  hypotheses, 

define 


fn(°)  5  SN(°JPNZ  '  PNS^:- 

Then  from  (i),  (ii)  and  (3.3)  we  conclude  rh.it  tor  Re  X  >  3, 

I^xIfnHn  *  0  as  s  *  " 

where  Jz^  is  the  Laplace  transform.  We  observe  that  from  the 
bounds  on  SN’PN,S  the  sequence  { FN )  is  uniformly  exponentially 
bounded,  i.e. 

|Fn(o)|  <  2Mee°|z|. 

Finally,  since 
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jY  PnS(t)z  =  PnS(t)Az 


and 


d 

dr 


sn(x)anpnz’ 


a  simple  quadrature  reveals 


(3.4)  Fn(o)  =  |a[SN(T)ANPNz  -  PNS(T)Az)di 


and  it  follows  that  (FN>  is  a  pointwise  equicontinuous  family  on 
[O,00).  (From  the  convergence  in  (3.1)-(ii)  and  the  bounds  on 
and  S,  one  easily  verifies  that  the  integrand  in  (3.4)  is  uniformly 
exponentially  bounded.)  We  are  thus  in  a  position  to  use  a  lemma 
due  to  Kurtz  (Lemma  2.11,  p.  359  of  [19])  to  conclude  that 
|Fn(o)|  ->-0  as  N  -*■  uniformly  on  compact  intervals.  (Actually, 

the  lemma  as  stated  by  Kurtz  requires  uniform  boundedness  of  (FN> 
but  a  careful  inspection  of  his  proof  will  convince  the  reader  that 
this  requirement  can  be  replaced  by  uniform  exponential  boundedness 
as  we  have  here.) 

We  thus  obtain  the  desired  convergence  (3.2)  at  least  for  each 
z  €  2  .  But  then  standard  density  arguments  (the  triangle 

inequality,  bounds  for  S^.P^,  and  the  density  of  <2>  in  Z)  can  be 
employed  to  establish  the  convergence  for  all  z  6  Z. 


Remark  3.2.  We  note  that  in  the  above  theorem  we  could  have 
hypothesized  A^  €  G(M,B)  on  Z^  instead  of  on  XN  without 
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altering  the  proof.  However,  in  the  applications  we  have  in  mind 

Avjt 

we  wish  to  obtain  invariancy  of  SN(t)  =  e  on  XN  (the  space 
where  our  approximating  systems  will  be  defined  and  used).  Thus, 
if  we  posit  An  €  G(M,3)  on  ZN  we  must  add  the  additional 
hypothesis  Im(AN)  <=  c  ^(AN)  in  order  to  use  the  approximation 
result  as  we  desire  below. 

Remark  3.3.  One  can  clearly  choose  (with  then  the 

identity  on  ZN  or  Z^  =  Z  (and  the  identity  on  Z)  and 

obtain  other  versions  of  the  approximation  results.  Again,  our 
choice  here  is  dictated  by  the  application  to  be  discussed  below. 

Remark  3.4.  In  the  event  one  has  ZN  =  XN  c  z  and  P^:  Z  ZN 
satisfying  PNz  -*•  z  for  z  €  Z,  then  the  condition  (3.1)-(ii)  can 
be  replaced  by  |AnPnz  -  Az  |  -*-0  and  the  conclusion  (3.2)  by 
|SN(t)PNz  -  S(t)z|  -*•  0.  This  then  is  essentially  the  version  of 
the  approximation  theorem  that  we  employed  in  previous  efforts  dealing 
with  control  problems  [  4  ],  [  5  J. 

6 

Corollary  3.1.  Suppose  the  convergence  in  (3.1)-(ii)  is  0(N  ) 

whenever  z  has  the  form  R^(A)y,  S(t)R^(A)y,  and  S(t)R^(A)y, 

X  >  3,  for  a  given  y  €  Z.  Suppose  further  that  the  constants  in 
•  6 

0(N  )  are  uniform  in  t  in  the  latter  two  cases.  Then  the 

convergence  in  (3.2)  is  also  0(N  )  whenever  z  =  R^(A)y  for 

this  y. 
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Proof:  Using  rather  standard  arguments  (p.  87,  [  22})  one  finds 
that 

H5  [SNCt-0)RxCAN)PNSC0)RJ(A)x] 

-  SNCt-o)[PNRx(A)  -  rxCAn)Pn]S(<J)x 
for  arbitrary  x  €  Z  and  X  >  g.  Hence,  we  have 
(3.5)  J*SN(t-a)[PNRx(A)  -  Rx(AN)PN]S(a)xdo 
*  Rx(AN)(PNS(t)  -  SN(t)PN]Rx(A)x. 

Using  (3.5)  and  (3,3)  we  have  for  any  y  £  2 

llsNCt)PN  -  PNS(t))R2x(A)y  lN 

<  |SN(t)[PNRx(A)  -  Rx(AN)PN]Rx(A)y|N 

+  i  WtSN(t)PN  '  PNSCt)]Rx(A)y|N 

+  UWPN  '  PNRXCA)]S(t)Rx(A)yiN 

<  Me6tUPNRx(A)  -  RxCAN)PN]Rx(A)y|N 

+  jo,SNCt'°),,{PNRXCA)  '  RA^AN)PN}S(a)y{NdcJ 
+ 


Urx(An)Pn  -  PNRx(A)}S(t)Rx(A)yjN 
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5  MeM  A  1  CPnA  -  ANPN)Rj(A)y|N 

*  |*Me6Ct'0)  A  l(PNA  -  ANPN)S(a)Rx(A)y|N  do 

*  A  ICPmA  -  ANPN)SCt)Rj(A)y|N. 

Thus,  if  y  is  chosen  as  in  the  hypothesis  of  the  corollary,  the 
conclusion  follows  immediately. 

? 

Theorem  3.2.  Let  &  <=■  )  satisfy  the  following: 

(i)  For  each  z  €  &  there  exists  k  =  k(z)  such  that 

|(AnPn  -  PnA)z|n  <  k/N6,  N  =  1,2 . 

(ii)  There  exists  ^  c  &  such  that  z  €  2^  implies 

(a)  S(t) z  €  j2,  0  <  t  <  T, 

(b)  S(t) (AI-A)z  £  x>e,  0<t<T 

and  furthermore  the  constants  guaranteed  by  (i)  for  (a) ,  (b)  can  be 
chosen  independent  of  t. 

Then  for  z  €  <0^  we  have  there  exists  k(z)  such  that 
|[SN(t)PN  -  PNS(t)]z|N  <  k(z)/N6 


for  0  <  t  <  T. 
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Proof:  Let  z  €  ^  and  define  y  =  (Ai-A)2z.  Then  R2(A)y  =  z. 
Furthermore,  by  (ii)  we  have  SCt)R2CA)y  e  .0  and  S(t)Rx(A)y  €  2 
with  the  constants  in  the  0(N’  )  estimates  uniform  in  t.  It 
follows  that  the  hypotheses  of  Corollary  3.1  are  satisfied  and 
hence  we  reach  the  desired  conclusion  since  z  =  R2(A)y. 
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4 .  Identification  schemes  for  delay  systems 

Let  it®,!!1  be  the  coordinate  projections  of  Z  =  Rn  *  L^f-r.O) 
onto  Rn,  L^f-r.O)  respectively.  We  recall  that  the  I.D.  problem 
of  §2  can  be  written: 

(  :  Given  input  u  and  observations  {y^}  at  ^i^j^l’ 

find  Y*  =  (h*,4>*,q*)  in  T  so  as  to  minimize 

i  M  .  7 

J(y)  *  J  l  ly (t. ;y)  -  y.  | 
z  i=l  1  1 

where  y  is  the  solution  of  (2.8),  (2.9).  That  is, 

y(t; Y)  =  C(ot)z(t;Y,u)  +  D(a)u(t) 

=  C(ot)ir°z(t;Y,u)  +  D(°)u(t) 

=  C(a)x(t ;  Y,u)  +  D  (a )  u  ( t ) 

Thus  the  identification  problem  can  be  viewed  in  a  state- 
space  Z,  parameter  space  r  setting.  This  will  lead  to  a  sequence 
of  approximate  I.D.  problems  if  we  approximate  Z  by  a  sequence 
of  spaces  X^. 

We  henceforth  assume  that  we  have  in  hand  a  sequence 
{r^},  0  <  r^  <  r,  and  take  closed  XN  c  ZN  =  Rn  x  L^-r^O).  Then 
we  define  the  operator  Z  •+■  Z^  as  the  operator  that  truncates 

irz  to  the  interval  [-r”,0]  and  then  denote  by  the  Moore- 

Penrose  [21  ]  pseudo  inverse  Z^  -*■  Z.  In  this  case  if 

+ 

z  =  (n.’IO  €  ZN,  then  J^N(n,t|0  =  (n,<)>)  where  <|>  =  <P  on 
[-r^,0],  <j>  =  0  on  [-r,-r^). 
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Let  {SN(t)>  be  a  Cg-semigroup  on  XN  with  i.g. 

An  €  G(M,8)  and  let  PN:  Z  -*■  XN  be  as  given  in  §3.  We  define 
rN  C  XN  x  Q  by  =  (P^j^)  x  Q  where  S/1  is  the  given  initial 
data  set.  We  make  the  following  standing  assumption  on  Sf  and  Q. 

•J* 

Assumption  4.1.  Q  and  Sf  are  compact  and  c  Sf 

for  all  N. 


From  this  assumption  and  the  continuity  of  PN,  we  readily 

N  N  N 

obtain  that  TN  is  compact.  Given  Y  =  (zQ,q  )  € PN  we  define 

(4.1)  zN(t ; YN,u)  5  SN(t)zJ  ♦  |%N(t-o)PNB(aN)u(a)do 

and 


(4.2)  yN(t; YN)  =  C(aN)zN(t;YN,u)  +  D(aN)u(t) 


«  C(aN)Ti0zN(t;YN,u)  +  D(aN)u(t). 


The  approximate  I.D.  problems  are: 


a.  _ M 

( :  Given  input  u  and  observations  {y^},  find  Y  €  rN  so 
as  to  minimize 


M 


JN(YN)  e  I  |yN(t.;YN)  -  y±\ 
i  =  l  1  1 


We  note  that  solutions  to  (  ^)  exist  since  the  mapping 
N 

S  -*■  J  (5)  is  continuous  on  and  is  compact. 
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— N  — N  — N 

Theorem  4.1.  Suppose  Y  *  (z^.q  )  is  a  sequence  of  solutions 
to  the  problems  ( ^ )  and  that  there  exists  7  €  r  such  that 
7N  7  in  the  sense  (a)  qN  -*•  q  in  RW+V,  (b)  -^Zq  -*■  in  Z. 
Suppose  further  that  AN  =  A^(q  ) ,  A  =  A(q)  satisfy  the  conditions 
and  hypotheses  of  Theorem  3.1.  Then 

|PNz(t;7,u)  -  zN(t;7N,u)|N  ■+  0  as  N  », 

uniformly  in  t  on  compact  intervals,  where 

ft 

(4.3)  z(t;Y,u)  =  S(t)zQ  +  j  S(t-o)B(a)u(o)do. 

Proof :  From  the  hypotheses  and  Theorem  3.1  we  have  immediately  that 

|SN(t)PNz  -  PNS(t)z)N  0,  uniformly  on  compact  intervals,  for  all 
z  £  Z.  Therefore 

|SN(t)z0  -  PNS(t)z0lN  =  |SN(t)PN-^NZp  -  PNS(t)z0lN 

-  I  ‘  PNZ0^N  +  lSN(t^PNZ0  '  PNS(-t^ZolN 

-  Me&  l^z0  "  Z0l  +  I  SN(-t-)PNZ0  “  PNS  ^^0 1 N  * 


\ 

1 


The  first  term  approaches  0  by  (b)  as  does  the  second  from  our 
preceding  remark.  Next,  consider 
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IN 


Ct-cr)P..BCaN)uCo)do  -  PM  I  S(t-a)BO)u(o)do 


N 


N  J 


N 


^  _ M 

r  n  Y 


ft 

=  [SNCt-o)PNBC“,',)uCo)  -  PjsjS  C t -  o)B(a)u(a)  ]do 


N 


<  |jSN(t-a)PN[B(aN)  -  B(a)]u(°)|Ndc 

+  J4!  CSNCt:-«)pN  '  PNSCt-a)]B(a)u(a)|Nda. 

The  results  of  Theorem  3.1,  the  continuity  of  B,  and  dominated 
convergence  yield  convergence  of  these  terms  to  0,  uniformly 
in  t.  The  desired  conclusion  follows  immediately  from  these 
estimates . 


Corollary  4.1.  Suppose  P^:  Z  -*•  XN  satisfies 


N 


(4.4)  ti°(Pnz)  -*■  in  Rn  for  each  z  €  Z. 


Then  under  the  assumptions  of  Theorem  4.1  we  have 


yN(t;yN)  •*  y(t;Y)  for  each  t, 


Proof:  Recall 


y(t;Y)  -  C(o)z(t;Y,u)  +  D(a)u(t) 


C(a  )m°z  (t ;  y  ,u)  +  D(a)uCt) 


while 
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yN(t;  YN)  =  C(oV°zN(t;YN,u)  +  D(aN)u(t). 

The  claimed  result  follows  at  once  from  the  result  of  Theorem  4.1 

|*°zN(t;YN,u)  -  *°p Nz(t;y,u)|  n  -  0, 

R 

and  (4.4)  which  yields 

|*°PNz(t;y,u)  -  n°z(t;Y,u)|  n  +  0. 

N  R 

N 

We  observe  that  in  (4.1)  we  define  z  for  initial  data 
in  Xjj.  However,  one  can  define  an  analogue  for  initial  data  given 
in  y.  In  particular  for  fixed  y  ~  (zQ,q)  €  T  =  y  *  Q  we  define 

(4.5)  zN(t;Y,u)  =  SN(t)PNz0  +  j^SN(t-c)PNB(a)uCa)da 

where  =  AN(q),  A  =  A(q)  are  i.g.'s  for  SN,S.  If  one  then 
assumes  that  AN(q),A(q)  satisfy  the  hypotheses  of  Theorem  3.1 
so  that  |PNS(t)z  -  S^(t)Pjsjz|N  -*■  0,  one  can  prove  in  almost  exactly 
the  same  manner  as  that  for  Theorem  4.1  above  that 

|zN(t;y,u)  -  PNz(t;y,u))N  -*  0 

for  Y  €  T.  Defining  yN(t;y)  as  in  (4.2)  except  with  zN(t;y,u) 

N  N 

of  (4.  5)  in  place  of  z  (t;y  ,u),  we  have  under  hypothesis  (4.4) 
the  analogue  of  the  results  of  Corollary  4.1: 
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(4.6) 


y  (t;Y)  +  yCt;r) 


for  each  fixed  Y  €  r. 

Theorem  4.2.  Suppose  { Y^)  is  a  sequence  of  solutions  of  the 

approximate  problems  (  under  Assumption  4.1.  Then  there 

-Nk  -Nk 

exist  Y  €  T  and  a  subsequence  (y  }  such  that  Y  Y 

— N  — 

in  the  sense  of  Theorem  4.1(a),  (b).  If  A^(q  ) ,A(q)  satisfy  the 
hypotheses  of  Theorem  3.1,  then  Y  is  a  solution  for  the  problem 
(<?). 


Proof:  First  we  have  YN  «  (Zg,qN)  £  *  Q.  Defining 

we  have  z”N  €  $/  ,  5/  compact,  so  that  there  exists 

-Nk 

a  convergent  subsequence,  say  (z  },  converging  to  some  zn 
•t  _Nk  - 

in  5/\  i.e.,  zQ  ■*  zg  in  Sf.  From  the  compactness  of  Q, 

k 

-Nk 

we  have  that  {q  }  possesses  a  convergent  subsequence  with 

_  .  _  _  _ 
q  3  -*•  q  for  some  q  €  Q.  Defining  Y  =  (zn,q)  £  V  -  Sf  *  Q, 

_N. 

and  reindexing  we  thus  have  a  sequence  (Y  that  converges  in 
the  sense  of  Theorem  4.1(a),(b)  to  y.  Furthermore,  it  follows  from 
Theorem  4.1,  Corollary  4.1  and  the  remarks  involving  (4.5)  and 
(4.6)  that  for  any  y  *  (z_,q)  £  r  we  have 


N.  N.  N.  _N 

J(Y)  =  lim  J  ^  (y  •*)  (Corollary  4.1  yields  y  J(t;Y  J)  y(t;r)). 


But  we  find 
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N.  _N.  N . 

lim  J  3  (Y  3)  5  lim  J  J ( CPM  zn>q)) 
N.-^o  N.-x»  "j  u 

J  J  J 


=  lim 


M  N. 

I  |y  3  Ct:-  ;  CPN  zQ,q)) 
i=l  1  u 


N. 

But  y  -*(t^;(Pjj  Zg,q))  given  by  (4.1)  is  exactly  the  same  as 
~N-  J  „N 

y  Y  =  (zQ,q)  ,  where  y  is  defined  as  in  (4.5),  (4.6) 

and  hence  the  latter  term  is  the  same  as 


M  N. 

lim  l  | y  3 (t Y)  -  y.  | 
N.-*»  [i=l  1  1 


J(Y). 


Thus,  Y  is  a  solution  for  ( 0>) . 


We  turn  next  to  a  discussion  of  particular  schemes  which  fit 
into  the  theoretical  framework  developed  above.  Throughout  our 
presentation  we  shall  assume  that  we  are  given  a  sequence 

YN  =  (nN,4>N,qN)  When  qN  =  (<*N,hN)  =  (aN,rJ . rj)  £  Q,  with 

0  <  r^  <  r^  <  . . .  <  r^  <  r,  and  q^  -*■  q  =  (»,h)  =  (a ,  r^ , .  .  .  ,  rv)  6  Q. 
We  recall  that  for  the  systems  under  discussion  we  have  the  operator 
defined  on  =  (  (<t>  (0)  ,<j>)  |  $  6  W^^C-rjO)}  given  by 


J*(q)  (♦«>),♦)  -  (L(q)4>,D*) 


where  the  operator  L  is  defined  in  (2.4).  Hereafter  we  shall  use 

the  notation  D<J>  in  place  of  <j>  in  context  where  confusion  might 
arise  otherwise. 
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We  summarize  for  future  reference  the  conditions  that  our 
approximating  schemes  must  satisfy: 

(4.7)  X^  is  a  closed  subspace  of  Z^  =  Rn  x  L^C-r^O),  is 

the  canonical  projection  of  onto  X^,  PN  =  ^ 

and  tt^(P^z)  ■+  for  all  z  €  Z. 

N 

(4.8)  There  exist  constants  M  and  6  such  that  _Q^  =  _Q?^(q  ) 

and  ja^(q)  are  in  G(M,8)  on  X^  and  Z  respectively. 

(4.9)  There  exists  ^  <=  3>  »  (q) ) ,  dense  in  Z, 

such  that 

(i)  Rx(J^(q))  c  %  for  X  >  8 

(ii)  l-^NpNz  -  Pn^In  0  as  N  »  for  z  € 

In  our  discussions  below  we  shall  refer  to  (4.8)  as  the 
stability  condition  while  (4.9)  will  be  called  the  consistency 
condition.  Our  first  scheme  will  be  based  on  the  averaging 
approximation  developed  in  some  detail  in  [  5  ]  while  the  second 
scheme  utilizes  spline  approximations  as  formulated  in  [10]. 
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5 .  The  averaging  approximation  scheme 

This  identification  scheme  is  defined  using  the  "averaging" 
type  approximations  as  discussed  in  [  4  ] ,  [  5 ]  and  many  of  the 
arguments  to  verify  that  conditions  (4.7),  (4.8),  (4.9)  are 
satisfied  are  only  slight  modifications  of  those  found  in  [5], 

into  sub- 


N  ,  N  N  N  N 

Given  q  =  (<*  tr1,...,rv),  we  partition  [-rv,0J 

intervals  [t^.t^^]  where  t^  e  -jr^/N,  j  =  0,1 . N.  Let 

XN  denote  the  characteristic  function  of  [t^,t^  , )  for 


j  =  2,3, 
N 


y  j-i- 

N,  with  the  characteristic  function  for  [t^,tg] 


[-rv/N,0].  Define  for  ( n , 4> )  £  Z 


,N  =  JL  f  ' 

j  ‘  rN  J  N 

L  \\  4-  1 


'll 


(s) ds , 


j  =  1 , 2 , . . . ,N, 


(5.1) 


♦S  5  "• 


We  define  the  closed  subspaces  X^  of  Z^  by 


xN  =  {(n,*)  £  zN|  n  e  Rn,  =  .^vjx^vj  e  Rn} 


The  projection  of  ZM  onto  XM  is  then  given  by 


‘N 


N 


(5.2) 


%(n,4>)  =  Cn,  l  A?) 

N  j=l  J  J 


With  these  definitions  it  is  immediately  obvious  that  (4.7)  is 
satisfied. 

For  the  operator  L  given  by  (2.4),  we  define  the  approximating 


operator  bN(qN):  xN  -*■  Rn  by 
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M  N  N 

(5.3)  LNCqN)Cn,  l  VjX j )  = 


A0(aN)n  +  l  l  AiCaIN)v.X^  C-r") 
i  =  l  j  =  1 


N 


+  l  k^C“n)v. 
j-1  3  3 


where 


N 


(5.4) 


K^(a)  5  [  K(a,6)dQ. 

J  J  N 


tj 

Next,  we  define  D^:  -*■  L^C-r^.O)  by 


(5.5) 


DN(n,  StjxJ) 

x  J  x  1  v 


N 

where  vQ  =  0.  Finally,  we  define  _Q^ (q  ) :  -*•  by 


:  (5.6)  -^N(qN)(H,t)  =  (LN(qN)(n,*),  DN(n,*)). 

j  The  proof  that  ja^N(qN)  €  G(M,8)  on  XN  for  some  M  and  3 
'  independent  of  N  is  essentially  given  in  [  5 ]  (see  p.  183  and 

p.  186).  One  first  argues  that  there  is  an  equivalent  inner 

product  <  ,  )N  on  XN  such  that  (srf  (qN) z , z )  N  <  8(qN)<z,z>  N 

g  g  g 

N  N  N 

for  all  z  e  X^.  As  in  [  5  ]  »  we  define,  for  given  r •  •  •  ,rv, 

N  N  N  N 

the  index  set  J  ==  ,}  where  j.  is  the  index  such 

1  v- 1  1 

that  -r^  e  [t^N,t^N  ),  i  =  l,2,...,v-l,  and  =  N.  We  next 

h  h'1 

N  N 

define  numbers  a^  by  aN  =  1  and,  for  j  =  N-l.N-2,. . .  ,1, 
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U  * 1 

if 

•  ^  tN 

J  €  J 

N 

j+l 

if 

•  *  tN 

J  l  J  • 

aN  = 
1 


Then  define  the  nondecreasing  piecewise  constant  weighting  function 
g  by  g  C9)  =  ,  t.  <  0  <  tj  2,  j  =  1,2,. .  .  ,N.  Finally,  we 

take  ZNCgN)  and  XN(gN)  as  the  spaces  ZN>XN  respectively  with 
equivalent  topology  generated  by  the  inner  product 

e  0  kt 


C5.7) 


|,<P)  ,(«,¥)>  N  s  <n,c>  +  [  N  WgN. 

g  R  J-r,. 


r  N 

If  we  then  consider  (n,<<0  =  (n,  l  v-x)  €  XM  (and  define  vr 

j=l  J  J  n  1 

we  find  in  a  straightforward  manner  using  estimates  similar  to 
those  in  [  5,  p.  186]  that 

v 

I  (5.8)  <^N(qN)(n»,(n^))N<  i\A0i°n)\  +7inAi(«N)l2}|n|2 

+  7  I  |v.  |2  +  I  |«M)||v  1||n|  -  I  <v  rv  v  Aa” 

L  i=l  h  j=l  3  3  j-1  3  3  3  3 


Noting  that  for  <P  =  I  vjXj  * 


l(n,¥)| 


XI 

,  N  r,.  , 

2  .  c  v  i  ..  i  2 


i  ■  i"i  *  i  if 

j  =  i  J 


we  find 


42 


k  n  f  r7Vj 

l  |k”on)I  |Vjl  hi  ■  l  N  J  N 

j=l  3  •'  3-1  rv  >  tj 


I  If  4f  1  f  3  1K(«N,9)d9  hi  IT 


(,  \  1/2  t?  , 

y  1  .  .2  N  [  KC°^ 

5  JA  7  |n|  7  In 

J-l  l  rv  J  tj 

. 


1  rv  .  1 2 


K(cxN,0)d0  +2"rlVj! 


3=1  I  rv  j  j 


i  1|vfrtN  B  'i  t  2  H0  +  i  -sr-  1  v  -  |  2 


N  r 


tU  N  2  1  V  V  ,  . 

|K(«N,e)|2d«  ♦  j  .J  TT  lvjl 
N  3=1 


2  N  1  j 


■i  +  i  f°  |K(aN,0)]2d8 
2  2  '  N 


Next  observe  that 


N  N  r  r  3-  l  I 

i  <vi-i‘vj’vj>aj  *  -V?  v3-1 
j=l  3  A  3  3  j  J=1 


2  +  i  Iv,  I2  -  |v4|2W? 


7  ,v3 


3*  3 


*  Ul^-Xl2  • 

3=1 

=  ^  |n|2a^  +  ^  +  i’aj  •* !  vj  I  2  "  7  I  VN I  2aN 

=  \  v| n| 2  -  i  l  lvi  I2- 
2  L  i=l  Ji 
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Using  these  estimates  in  (5.8)  we  finally  obtain 

<-^N(qNXn^),(n,'l»))  N  <  8(qN)  |  (n>)  1^ 

g 

where 

0(qN)  =  |A0(aN)|  +  \  l  1a  (0tN)  1 2  +  1  f°  |K(aN,6)|2de  +  ^  . 

u  i=l  1  L  J  N  L 

*rv 

From  the  continuity  assumptions  made  in  §2  (see  (2.4))  and  the 
fact  that  q  €  Q,  Q  compact,  we  have  existence  of  8  such  that 

8(qN)  £  8  for  all  N.  Finally,  since  the  and  XN(gN)  norms 

are  equivalent  independent  of  N,  one  finds  (again  see  p.  186  of 
[  5])  fl^(qN)  £  G(M,B)  on  XN.  Since  fl^(q)  is  the  i.g.  for  a 
CQ-semigroup  it  also  satisfies  the  requirement  ja^(q)  €  G(M1,81) 
on  Z  for  some  and  Bj.  It  follows  that  the  stability 

condition  (4.8)  is  satisfied  for  our  averaging  approximations. 

We  next  consider  the  consistency  criteria  (4.9).  We  take 
=  { (<J>  (0)  ,<j> )  | <J>  is  C1  on  [-r,0]>.  Then  clearly  is 

dense  in  Z  and  <=■  iS?(_Q^(q)).  Furthermore,  R^(A(q))  ^  c: 

(q))  c  (see  §2  above)  so  that  (4.9)- (i)  is  satisfied 

for  this  choice  of 

It  remains  to  establish  (4.9)-(ii).  Given  z  =  (^  ( 0 )  ,4> ) 
in  we  observe  that 

(5.9)  PNJ3^(q)z  =  (L(q)$, 


where 
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CW) 


N 


_  N  f  J 
=  7  J  N 


D*(s)ds  = 


4  wcty.j)  -  +ctj)i 

rv 


while 


(S.10)  J^N(qN)PNz  =  -^NCqN)(*o,Ji*^Xj)  =  dN  (qN)  PNz  .D^z) 


N 

where  LN(q  )  and  DN  are  given  by  (5.3)  and  (5.5).  In  view  of 
(5.9),  (5.10)  and  (1.4),  it  thus  suffices  to  show 


(5.11)  A„(aN)*(0)  +  I  I  A.(aN)^xf(-rf)  +  I 


i-1  j-1  1 


n  i' 


j-i 


— A  (a)*(0)  +  l  A.  («)*(- r.)  + 
Rn  u  i=l  1  1 


f0 


K(a,0)*(0)d8 


*  r. 


and 


CS.12)  f°  I  l4  (♦*-♦»)  -  (»)"]x» 
J.N  j-1  r”  3  1  3  J  3 

*  %  k 


N  — 

as  N  -*•  «  (and  r^  -*•  r^). 

Consider  (5.12)  first  and  write  this  integral  as 

N  ,  2 


f 

'  N 
'rv 
“FT 


4  (*S-*i-[*(0)-*(-^)l) 

rv 


*  i  I 

j-2  )tN 


N  ,*N  .  N  r .  ,  N  ,  .N,,. 

H  C^.1-*j-[*(tj.1)-  (t.)]) 

rv 


TN  mN 
A1  1 2  * 
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Using  an  analogue  of  (3.18),  p.  177  of  [  5 ]  with  r  replaced  by 
N 

rv,  estimates  similar  to  those  of  [  5]  yield 


TN  <  rN 
1 2  -  rv 


sup 

1<  j  <N  3 


where,  as  in  [  5  J ,  we  define 


=  sup{  |<i>(0) -♦  (s)  |  s,  9  £  [tj.tj^]} 


N 


Use  of  the  analogue  of  (3.18)  of  [  5 ]  in  allows  us  to  write 

(after  arguing  in  much  the  same  manner  as  done  on  p.  177  of  [5]) 

rN  r  -rN  V 

T1  -  IT  1  *1*  ' 

Since  -*■  0  as  N  -*■  «,  uniformly  in  j  ,  we  conclude  that  (5.11) 


obtains . 


We  remark  that  if  <i>(0)  =  0  and  <j>  €  W^2-*  (-r ,0) ,  then 


•  -r" 

♦C  -?r) 


-r. 


.N 


*(— )  -  no) 


<  Sup  |  4>  (6  )  |  -jq-  , 


rN 


M  L1' 

and,  since  is  0(-^)  -  see  p.  178  of  [  5  ]  ,  we  find  that  the 

convergence  in  (5.12)  is  O(-i*0  or  that  the  convergence  in  the 

N  1 

second  component  (L^  component)  of  (4.9)- (ii)  is  O(^). 

Returning  to  (5.11)  and  recalling  that  A^o^)  A^cT),  we 
see  that  to  establish  C5.ll),  we  only  need  show 
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(5.13) 


l  ♦  Mc.-r?)  -*•  <K-r.),  i  =  1,2,. ..,v, 
j  =  1  J  J 


(5.14) 


I  K?(«N)*?  -  f°  K(ot ,  o)<J>  (a)do. 

j«l  3  3  J.? 

v 

For  <J>  in  C1  on  [-r,0]  we  have,  for  9  £  [tjjt^  ^), 


tN  , 

’  J-l 


(5.15)  |4>^-4>(0)|  =  4  [  3  A[<Ko)-«K9)]da  <  N  f  3  |a-9|do 

3  r!4.  3  N  '  r„  )  N 


v  t 


5  4  l^looC^)2  =  lil.  IT 

r« 


v  t 


From  this,  it  follows  immediately  that 


I  (5.16) 


i_  .N  N  a  |  2  _  r\i\  ..Z,irv.2 

|E  N  5  (1*1.)  C-nO  • 

3  3  L2(-r;,0)  N 


4  =  4* 


N  .  N 


For  *  j ^  chosen  so  that  -r^  €  [t^  ,t^  _^) 


we  find 


using  (5.15) 


|I  ♦jXj(-r5')-«K-r.)|  <  -♦(-rj)|  +  ] M-r?) -♦  (-r.)  | 

rN  1 

5  l*L  ir  +  I* I Jrrril 

and  thus  the  convergence  in  (5.13)  is  insured  by  the  convergence 

N  —  1  N  — 

ri  ri»  the  order  given  by  if  r^  -*■  r^  is  of  this  order. 
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Finally,  in  considering  (5.14)  we  note  that 

t-_ 

i  =  l  f  l  W^a^dS 

j  =  l  J  J  j=lJ  tj  J 

=  f°  K(aN,0)  ?  ♦  W(0)d0 

J  N  j=l  3  3 


and  hence 


=  I  K^(aN)<j>^  -  f  K(u,o)0(o)dc 
j=l  J  J  J  _ 

‘rv 

-  [  K(aN,o)[E  4>M(o)  -  <?>(°)jdo 
J  N  1  J 


We  thus  find 


,  N , 


,0 

1/2 

’  o 

< 

f  |K(aN,o)|2do 

lr  *N  N  A  ,  2 

1  N 

J  N  J  J 

'rv 

’rv 

r°  n  r° 

K(a  ,o)<j>(a)do  -  K(a  ,o)<|>  (o)da. 

N  >  - 

-r„  -rs, 


1/2 


f0  K(aN, 

J  N 
■r.. 


a)4>  (a)  da 


[°  - 

J  K(a, 


0)0 (a) do 


-r. 


The  first  term  is  0(j|j)  by  (5.16)  while  standard  estimates  on  the 

M  _ 

second  term  yield  that  it  -*•  0  since  K(»  ,•)  -*■  K(a,.)  in  L 2 
N  — 

and  rv  ry.  The  order  of  convergence  of  the  second  term  depends 

M  _  1 

on  that  of  these  latter  two.  If  |ry-r  |  =  O(^)  and  if  the 
convergence  K(<*N,‘)  ■*  K(a,’)  in  is  0(jjp  ,  then  is 
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OCjjj)  also. 

In  summary,  we  have  established  (.4.7),  (4.8)  and  (4.9)  for 
the  averaging  approximation  I.D.  scheme.  In  doing  so  we  have  also 
shown  that  under  certain  circumstances,  the  convergence  in  (4.9)- (ii) 


is 

°tH> 

• 

In  particular, 

.  r  .  N 

if  in  q  -*■ 

q  we  have 

(5. 

17) 

(a) 

The 

N  — 

convergence  A^(ot  )  -►  A^(ot) 

is  0( 

I) 

NJ  » 

(b) 

The 

N 

convergence  K(a  ,*) 

+  K(a,-) 

in 

^(-r^O) 

is 

0(i>, 

(c) 

N 

ri  ' 

♦  r .  is 

‘  - 1. 

» 2  , . . .  ,  v, 

(d) 

z  = 

(♦  (0  )  ,<P  ) 

,  0  e  C*  on 

[-r,0] , 

♦  (0) 

=  o, 

then  I  ^N(qN)PN(*  (0),*)  -  PNJ^(q)  (♦  (0)  ,♦  )|  N  is  0(±). 

Remark  5.1.  We  remark  that  the  conditions  (5.17a-c)  clearly  are 

not  conditions  that  can  be  verified  a  priori  when  using  the 

averaging  scheme  in  practice.  These  error  estimates  merely  provide 

information  as  to  how  well  the  scheme  might  perform  when  applied  to 

specific  I.D.  problems.  Note  that  the  particular  method  (maximum 

likelihood  estimator,  least  squares,  etc.)  chosen  for  determining 
N 

q  will  obviously  affect  the  rates  of  convergence  in  (a)-(c)  above. 
Finally,  if  one  drops  the  condition  <P  (0)  =  0  form  (5.17d)  but 
retains  all  other  conditions  in  (5.17),  one  finds  the  order  in 
(4.9)-(ii)  is  only  1//R. 
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Remark  5,2,  Recalling  the  order  estimates  on  C3.2)  given  in 
Theorem  3.2,  we  observe  that  one  can  easily  find  sets  &  and 
^  to  satisfy  the  hypothesis  of  that  theorem  in  case  of  the  averaging 
based  scheme.  For  example,  to  insure  convergence  of  order  1//N, 

2  —  7  _ 

one  can  choose  the  sets  .0  =  (q))  and  ^  (q)) 

-  {(HO),*)  |  *  e  w£3)(-r,0),  HO)  =  LO),  HO)  =  l($)}.  Then 
one  can,  under  the  assumptions  (5.17a-c),  without  difficulty  argue 
the  claimed  order  results. 
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6 .  Spline  based  approximation  schemes 

We  discuss  in  this  section  an  identification  scheme  based  on 
spline  approximations.  While  we  shall  present  the  details  for  a 
scheme  based  on  first  order  splines,  arbitrary  order  spline 
approximations  may  be  utilized  in  a  similar  manner  with  only  slight 
modifications  in  the  arguments  indicated  below  (see  the  theory 

developed  in  [10] ,  on  which  all  of  our  discussions  here  are  based). 

N  NN  N  —  —  —  — 

Given  q  =  (a  ,rlf...,rv)  -*•  q  =  (a  ,r1 , .  .  .  ,rv)  as  we  have 

hypothesized  previously,  we  partition  each  of  the  subintervals 
N  N 

E-r^, -rjf_j] ,  k  =  l,2,...,v,  into  N  equal  subintervals  to  define 
the  partition  {tj)Vj_j  of  [-r^,0]  with 

(.6.1)  t J  =  -  (j-Ck-D^frJJ-rJJ.^/N  +  rJJ.j, 


j  =  (k-l)N, . . . ,kN,  k  =  l,2,...,v.  We  then  define  the  finite 
dimensional  subspace  XN  c  by 


=  (  0>  (0)  ,<P)  |  <J>  is  a  first  order  spline  with  knots  at  { t ^ } 


N 

We  define  the  weighting  function  g  by 


gV)  ■  < 


1 

2 

v-1 

v 


N  .  .  N 

tv  5  9  <  -rv-i* 

-rN  <  6  <  rN 

rv-i  -  rv- 2  ’ 


N  q  N 

-r2  <  9  <  Tj 

■rj  <  9  <  0, 
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and,  as  in  §5,  denote  by  ZN(gN)  and  XN(gN)  the  spaces 

Zj^j  and  endowed  with  the  equivalent  topology  generated  by  the 

weighted  inner  product  (5.7).  We  then  define  -*• 

N  N 

(equivalently  Tt^:  Z^(g  )  -*■  X^(g  ))  as  the  orthogonal  projection 
of  Z^(gN)  onto  XN(gN) .  Then  (see  [10  ,  p .  509])  for  in  Z^, 

am 

we  have  where  <l>  is  the  solution  of  the  problem  of 

minimizing  |$-'|<|  N  over  $  €  X^.  The  operator  PN:  Z  -*■  X^ 

zN(g  ) 

is  defined  as  before  by 

We  adopt  the  following  notation.  For  any  function  <}>  that  is 

N  A 

defined  pointwise  on  [-rv,0],  we  write  <P  =  (4>(0),4>)  and 
am  N  N  N 

♦  l  =  (4>j(0),4>1)  where  <f>  j  is  the  interpolating  spline  (with  knots 

at  {t^}VN  )  for  4>  on  [-rfLo].  For  the  projections  n 
3  j=l  v  N 

defined  above  we  shall  write  =  ^((j)  (0)  =  (d> N  (0)  ,$N)  . 

For  any  qN  €  Q,  we  define  the  operator 
stt qN)=  ^(j*T(qN))  c  ZN  -  ZN,  where  0(j^(qN)) 

=  {  (♦  CO)  ,4»)  e  ZN|*  e  Wp^C-rJf.O)}  ,  by 

a^(qNl4>  (0)  ,♦)  =  (L  (qN)<J>  >  D4> ) . 

More  generally,  for  any  q  6  Q,  we  can  define  q) :  e  Z  -*■  Z 

by 

ja^(q)  (<J>  C0)  ,♦)  =  (L(q)+  , D4> }  . 

Note  that  in  this  latter  case  D4»  is  defined  on  [-r,0],  while 

N 

in  the  former  D$  is  defined  on  [-r  ,0].  However,  in  both 
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cases  the  operators  are  essentially  the  same  and  in  the  discussions 
below  we  shall  abuse  notation  and  speak  of  srf  as  an  operator 
defined  either  in  or  Z,  depending  on  the  context.  We  note 

that  XN  c  i^(ja^(qN))  so  that  ja^(qN)  is  defined  on  all  of  XN> 

With  the  definitions  above,  we  have  immediately  from  Lemma  2,3 
of  [10  ]  that  ^CqN)  satisfies 


(6.2)  (j^CqN)z,z)  N  <  ^(qN)|z|2N,  z  €  ^(j^(qN)), 

g  g 


where 


O) 


(qN)  =  ^  +  |AQ(aN)|  +  \  l  |A  (aN)|2  +  I  f°  [  K(aN,6 )  |  2d«  . 
U  1  i=l  1  1  J  N 


N, 


We  next  define  -0^N(q  )  :  XN  -*•  XN  by 


i  (6-3^  ^N(qN)  =  *NJ^(qN)V 

i 

I 

In  view  of  (6.2)  and  the  fact  that  tt^x  =  x  for  any  x  €  X^,  we  find 
for  every  x  e  XN 


^N(qN)x,x>  N  =  <nNja^(qN)x,x) 
g 

=  (ja^(qN)x,x)  <  u)(qN)  |  x  |  2N> 

g 

As  noted  in  §5,  there  exists  3  such  that  w(qN)  5  3  for 

all  N  and  thus  fl^(q^)  €  G(M,3)  on  X^(gN)  and  hence 
N 

ja^jCq  )  e  G(M,e)  on  X^  for  some  M  independent  of  N.  Similar 
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arguments  establish  that  _Q^(q)  €  G(M,6)  on  Z  for  M,g 
appropriately  chosen  and  it  thus  follows  that  condition  (4.8)  is 
satisfied  by  the  approximations  (6.3). 


We  turn  next  to  the  consistency  condition  (4.9)  and  define 
^  =  5^Cq^( q] ) .  This  set  is  dense  in  Z  and  it  follows  at 
once  that  R^(_a^(q))  ^  <=  ^  so  that  (4.9)- (i)  is  satisfied. 

For  z  =  (4>  (0)  ,4>)  €  ^  we  have 

^N(qN)PNZ  =  \(L(qN)4>N,D0N) 

where  $N  =  PN(4>  (0)  ,<J> )  =  ^ (0)  ,<}>)  »  while 

PNJ*Tq)z  =  -^N(L(q)4>  ,D4>)  . 

It  thus  follows  that  for  z  €  ^ 

/ 

j  (6.4)  |J^NPNZ  -  Pn-Q^z|n  -  hN(L(qN)<j>N  -  L(q)q>,  D(4>N-<I>  )  )  I  N 

N 

where  we  now  understand  that  D(4>  -<t>)  is  to  be  taken  as  a  function 
N 

on  [-rv,0].  Recalling  that  is  the  orthogonal  projection  of 

N  N  N 

ZN(g  )  onto  XN(g  )  and  that  the  norms  of  ZN(g  )  and  ZN  are 

equivalent  (with  constants  independent  of  N),  i.e. 

|V|  c  |  z  |  jsj  <  rn|  z|N,  we  see  that  (6.4)  allows  us  to  establish 
g  g 

condition  (4.9)-(ii)  by  verifying 
(6.5)  |  D(d>N-d>)  J  -►  0  as  N  • 


and 


(6.6) 


L(qN)4>N  -  L(q)<{>|  +  0  as  N  ■*■ 

Rn 


N 

where  in  (6.5)  the  norm  can  be  that  of  L^f-r^O)  or  L2  with  the 
weighting  function  (see  (5.7))  g^. 

To  show  (6.5)  and  (6.6)  we  shall  make  use  of  standard 
estimates  from  the  theory  of  spline  approximations.  Specifically 
from  Theorem  2.5  of  [  24  ]  we  find,  upon  considering  the  interval 
[tkN,t(k-l)N^ ’  k  =  which  has  mesh  size  h  =  (rk'rk-i^/N» 


tN 

f(k_l)N  xt  2  , 

(6.7) 

J  N  1  IT 


kN 


N  N 

Wi 

N 


'  tN 

f  dc-i)N  2  2 

|D  <M2 

J  N 


kN 


and 


tN 

f(k-l)N  X.  2  x 

(6.8)  5  IX 

J  N  1  rr4 


kN 


rN-rN 
rk  rk-l 

N 


t(k-i)N  2  2 

IdV. 


r 

J  N 


IcN 


N  2 

Here  j  is  the  interpolating  spline  for  <P  €  C  [-r,0]  with 

knots  at  {t^}.  Denoting  by  I’l  2  N  tke  norm  ^(-r^.O),  we 

deduce  from  (6.7)  and  (6.8)  the  estimates 


Id«-<)I2jN  ;  4  Uax|r»-r».i|}I  |D2*|2>N 


(6.9) 


C6.10) 


1 j  1  2  ,  n  5  ^{maxjr 


N  N  i  1 2  1 
y  i  1 4  r  i  _  i  I  j 

j  J  J  x 


5* 


d2<j> 


2  ,  N  ‘ 


i  i  N 

Denoting  by  | * |  the  weighted  norm  in  L7(-r,0),  we  easily 

2,11,/  2  v 

argue  for  4>  =  C4>(0),4>)  6  ^  (using  the  minimality  properties 

associated  with  tt^) 


N 

g 


5  u;-ii  N 

g 


2  »N , gN  1  2  , N 


From  (6.10)  we  observe  that  this  last  quantity  is  0(^0  and  hence 

so  is  |  "j/  $|N.  ^  follows  immediately  that  |<f>N(0) -4>  (0)  |  n 

1  ^ 
is  0 ( — y)  also. 

hr 

We  remark  that  we  have  shown  that  I7Tj/'iMn  "*■  0  whenever 
$  €  The  density  of  ^  in  Z  and  the  boundedness  of 

{"N}  thus  imply  this  convergence  (|"Nz-z|N  -*•  0)  for  all  z  €  Z 
and  the  condition  ff°(PNz)  -*•  tt°z  for  z  £  Z  of  (4.7)  is  satisfied. 
We  next  consider  the  inequality 


(6.11)  |D(4>N-*)I2,n  5  |  D(4>N-<)>  j  )  t  2  ,  n  +  lD^I-*)l2,N 

and  observe  that  the  second  term  is  O(^)  by  (6.9).  We  employ 
the  Schmidt  inequality  [  24  ]  to  estimate  the  first  term.  Since 
both  <j>N  and  <f>j  are  linear  on  each  subinterval  [t^.tj^J  we  have 
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(6.12) 


N 


|D($N-*J)  I  2 


j= 1  JtN  1 


N  A,2 


-  r 


<  T 

j=i  jtN  1 


V 

-  l 


3T 


k=!  tCr^r^/M]2  '_r 


f  k‘1|0N-<l 

J  N  1 


v 

<  I 


3T 


k-1  [(rjJ-rJJ.^/N]- 


CJ.N  ,  rpN 

T1  V 


Using  (6.8)  we  obtain  the  estimate  for  T™ 


TN2  <  l  4  1 

Z  k=l  TT4 


3T 


N  N 

Vrk-1 


N 


N 

I  ^l»2*|2  i^(5)2lnl*l2.H' 

-r,. 


N 

To  obtain  the  desired  estimate  on  we  need  an  additional 

assumption  on  q  =  (a, , . . . ,rv) .  Specifically  we  assume: 

(6.13)  There  exist  <5  >  0  such  that  >  <5 »  k  =  1,2,...,> 


With  the  assumption  we  find  (for  N  sufficiently  large) 
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But  our  arguments  above  revealed  that  is  0(^=-)  and 

L » 

thus  T^,  like  T^,  is  0(^y).  It  follows  from  (6.12)  that  the 
1  L  fT 

first  term  in  (6.11)  is  O(j^).  We  have  thus,  under  the  additional 
assumption  (6.13),  established  (6.5). 

N 

Finally,  we  observe  that,  for  -rv  <  9  <  0, 


<J>N(9)  -  <j>N(0)  +  [  D<!>N 
J  0 
(0 

<K6)  =  ?(0)  +  M 


and  thus 


|<})N(0)-$  (9)1  <  |  <t>N(0)  -<i>  (0)  I  +  [  |Dq>N-D4>| 

J  _N 


<  |<^N(0)-4i(0)|  +  /F|D($N-$) 


But  these  last  two  terms  are  O(j^),  uniformly  in  9 .  it  follows 
that  |<},N(-r^)  -  <j>(-r^)|  is  O(jjj).  Since  <p  is  continuous  and 
qN  -*•  q  we  find  that  L(q^)<f>^  -►  L(q)<f>  and  thus  (6.6)  obtains. 

Summarizing,  we  have  shown  that  (4.7),  (4.8),  (4.9)  (where 

N  N 

is  now  the  projection  of  Z^(g  ^  onto  XN(g  ))  hold  for  the 
first  order  spline  based  scheme  defined  by  the  operators  in  (6.3) 
under  the  assumption  (6.13).  Furthermore,  if  one  inspects  carefully 
the  estimates  given  above,  one  finds  that  under  the  hypothesis 
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■  (5.1 7a-c) ,  the  convergence  in  (4.9) Cii)  is  O(i)  . 

Remark  6.1.  In  the  above  estimates  we  had  chosen  ~  (q)) 

so  that  in  (6.4)  and  the  subsequent  arguments  was  in 
wJj^(-r,0).  To  apply  the  needed  estimates  (e.g.,  Theorem  2.5  of 
f  24])  and  make  the  arguments  above,  it  is  actually  sufficient  to 
have  in  W^^(-r,0)  (see  Theorem  21  of  [  2S  ]).  We  thus 
could  have  just  as  easily  chosen  3)^  =  (q))  and  arrived 

at  the  conclusions  above,  including  the  convergence  rates  obtained. 

Remark  6.2.  In  light  of  the  above  remark,  we  may,  in  order  to 
obtain  that  the  approximating  semigroups  converge  like  O(^)  for 
the  scheme  developed  here,  choose  &  -  (q))  and 

=  SK-Q^3(q))  in  Theorem  3.2.  Under  assumptions  (6.13)  and 

1  (5.17a-c),  one  then  can  readily  verify  that  the  hypotheses  of 

I  Theorem  3,2  are  satisfied  by  the  spline- based  approximations. 

i 

i 

i 


»-..<*  ****■.  'V'- 


.  Vi  *w  *  -‘ 


7.  Numerical  results 

In  this  section  we  present  a  brief  summary  of  some  numerical 
results  for  the  identification  problem  obtained  using  the  approx¬ 
imation  schemes  (AVE  and  SPLINE)  outlined  in  the  two  previous 
sections.  For  a  more  detailed  discussion  of  the  numerical  perfor¬ 
mance  of  the  AVE  and  SPLINE  schemes  in  identification  and  control 
problems,  the  reader  can  consult  [8]  where  numerous  examples, 
error  analyses,  etc.  are  presented.  The  summary  given  here,  "aken 
with  the  extensive  numerical  tests  reported  in  [8] ,  support  our 
claims  of  efficacy  and  practical  usefulness  for  these  methods. 

In  order  to  generate  the  data  for  testing  the  algorithms  we 
select  a  "true"  set  of  parameters  y*  =  (n*  ,$*  ,a*  ,r*)  (we  take  v  =  1 
and  r ^  *  r)  and  a  control  u  and  use  the  method  of  steps  [14]  to 
solve  for  x  on  the  interval  [0,T].  In  all  of  the  examples  presented 
I  below  "data"  was  generated  using  r*  =  1  and  u  =  u^ ,  where  u^  is 
j  the  unit  step  at  t  =  1  defined  by 


ut(t)  - 


t  <  SL , 


a  <  t, 


and  0  <  l  <  1.  The  final  time  of  T  *  2  was  used.  The  observations 
y^  =  y(t^)  were  generated  at  101  equally  spaced  time  steps  on 
[0,T].  It  is  possible  to  add  noise  to  the  "data"  to  produce 
"noisy  observations"  y(t)  *  y(t)  +  v(t),  where,  for  example, 
v(t)  =  col(vj(t),  ...,  v^Ct))  is  a  computer- simulated  vector  of 
normal  random  variables  v^(t),  each  with  zero  mean  and  preset 


«4-, 
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standard  variation.  This  was  done  for  some  of  the  examples  in  [8] 
(we  do  not  do  it  here)  where  one  again  finds  that  the  algorithms 
perform  quite  well. 

For  each  fixed  N,  the  approximation  problem  (  was  solved 

using  a  maximum  likelihood  estimator  (MLE) .  The  resulting  solutions 

are  denoted  y^  and  Yg  for  the  AVE  and  SPLINE  schemes  respectively. 

Since  the  MLE  is  an  iterative  procedure  it  is  necessary  to  supply 

a  startup  value  (i.e.  an  initial  guess)  for  the  parameters  y^  or 
N 

Yg.  If  6  denotes  an  unknown  parameter  to  be  estimated  (e.g.  B  =  a 

N  I  -  N 

or  B  =  r) ,  then  B  *  will  denote  the  estimate  for  B  obtained  after 

I  iterations  of  the  MLE  applied  to  problem  ( j^)  .  The  startup 

N  0 

value  is  denoted  by  B  ’  ■ 

Example  7.1 

In  this  example  we  seek  to  estimate  the  initial  data 
(n , )  €  R  x  L2(^l,0)  and  the  coefficient  of  the  delayed  term  in  a 
simple  scalar  equation.  The  system  is  described  by  the  equation 

x(t)  =  .05  x(t)  +  a^ft-l)  +  u  ^  C  t ) , 

with  (unknown)  initial  data 

x(0)  =  n,  xQ(s)  =  ♦  C®)*  -1  <  s  <  0, 

and  output 

y (t)  =  x ( t ) . 


in  i- 


61 


Data  was  generated  as  described  previously  using  the  true  values 
n*  =  1 ,  <{> *  =  1,  aj  =  -4.0.  For  each  N  =  2,  4,  8,  16  and  32,  the 
approximating  problem  (i^)  was  formulated  as  discussed  in  section 
4.  Thus,  for  AVE  we  seek  the  "parameter" 


-N  _  ,  .N  .N  .N  v 

-  In  »<f>2 »  •••»  *N  ’  1  ^  * 

N  N  N 

where  Cn,4>2»<^2*  •••»  are  coordinates  of  the  AVE  projection  of 

the  initial  data.  Similarly,  for  SPLINE  we  seek  the  "parameter" 


-N 


=  rcN 


cjj,al) , 


N  N  N 

where  (£q,5j,  ....  SN)  are  coordinates  for  the  SPLINE  projection 

of  the  initial  data.  The  "start-up"  for  (n,4>)  C  R  *  L2(-1,0) 

was  taken  as  the  zero  initial  data  (0,0),  whereas  the  "start-up" 

N  0 

for  a^  was  chosen  as  a^’  =  -3.0.  Table  7.1.1  provides  an  overview 

of  the  numerical  findings.  Because  the  initial  data  is  in 
R  x  L2(-1,0)  we  have  only  displayed  the  Z-norm  of  the  error  and 
the  estimated  value  for  a^.  The  comparison  of  the  two  schemes  is 
quite  striking;  in  particular,  note  the  relative  accuracies  in 
estimating  the  initial  data.  Compared  in  Figure  7.1.1  are  graphs 
of  the  true  initial  data  and  the  corresponding  estimates  produced 
by  AVE  and  SPLINE  for  N  ■  4.  It  is  apparent  that  (at  least  for 
the  chosen  "start-up"  values)  the  SPLINE  procedure  readily  finds 
good  estimates  for  the  parameters,  while  the  AVE  scheme  has  con¬ 
siderable  difficulty. 
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It  is  of  some  interest  to  compare  the  sequence  of  data  fits 
generated  as  the  MLE  iteration  procedure  evolves.  Figures  7.1.2, 
7.1.3  and  7.1.4  show  the  data  matches  from  the  AVE  algorithm  (with 
N  =  8)  for  MLE  iterations  0,  4  and  9,  respectively.  From  the 
match  at  iteration  4  (Figure  7.1.3)  it  might  be  deduced  that  the 
AVE  scheme  is  in  trouble.  However,  at  iteration  9  the  fit  is  quite 
good  and  Figure  7.1.4  does  not  give  any  hint  of  the  poor  values  of 
the  parameters  indicated  in  Table  7.1.1. 

Figures  7.1.5,  7.1.6  and  7.1.7  illustrate  the  SPLINE  matches 
at  iterations  0,  4  and  9,  respectively.  Again  the  iteration  4 
matches  indicate  some  difficulty  while  by  iteration  9  the  match  is 
quite  good.  It  happens  that  the  SPLINE  estimates  of  the  parameters 
are  excellent. 

Although  one  cannot  be  certain,  for  the  AVE  scheme  it  does 

appear  that  the  MLE  procedure  is  converging  to  a  local  minimum  of 
N 

J  .  We  suspect,  however,  that  the  problem  (i^N)  suffers  a  lack  of 
identif iability .  (See  [8]  for  a  further  discussion  of  this 

matter.)  The  problem  (J^)  for  SPLINE  seems  to  be  much  better 
behaved. 

In  order  to  investigate  further  identif iabi 1 ity  for  problems 
with  unknown  initial  data,  we  made  further  computations  for  this 
example  using  the  same  dynamics,  changing  only  the  initial  data  to 

n  =  1,  <|>(s)  *  1  +  s,  -1  <  s  <  0. 


Using  the  same  start-ups  as  above,  we  found  that  SPLINE  converged 
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for  all  N  values,  whereas  AVE  never  did.  Results 
in  Table  7,1.2. 


are  summarized 
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AVE 

SPLINE 

N 

iN 

al 

1  z* (0) - zN (0)  | 

N 

-N 

al 

| z* (0) -zN  (0)  | 

2 

-4.4103 

2.08 

2 

-4.4382 

.1595 

4 

-4.9924 

4.53 

4 

-3,9381 

.0867 

8 

-4.2651 

41.76 

8 

-4.0031 

.0287 

16 

did  not 

converge 

16 

-4.0031 

.  0201 

32 

did  not 

converge 

32 

-4.0001 

.0386 

TABLE  7.1.1 


_ _ AVE _ SPLINE _ 

N  |z*(0)-zN(0) |  N  aj  | z * (0) - zN (0) | 


2 

2 

-4.5201 

.  0563 

4 

did  not  converge 

4 

-4.0975 

.0318 

8 

8 

-4.0282 

.0123 

16 

16 

-4.0123 

.0193 

32 

32 

-4.0122 

.0936 

TABLE  7.1.2 
(linear  initial  data) 


T 


T 


AVE  N  =  8 
MLE  ITR  =  0 


66 


x(t)  *  ,05x(t)  +  a.x(t-l)  +  u  (t) 

•  •  X 

x(0)  =  7]  ,  XQ(s)  a  Cp(s) 


data 

a*  =  -4.0 

AVE  Model 

8,0 

=  -3.0 

|z*(0)  -  28,°(0)|  =  fl 

SPLINE  N 
MLE  ITR 


.  x(t)  -  .05x(t)  +  ajX(t-l)  +  u 
x(0)  »  Tj  ,  xQ(s)  =  cp(s) 

+  +  +  data 
-  SPLINE  Model 


TIME 

FIGURE  7.1.6 


2 


Example  7.2 

We  consider  an  equation  with  a  continuous  (a  constant  function) 
kernel  in  which  we  wish  to  estimate  the  kernel,  a  system  coeffic¬ 
ient,  and  the  time  delay.  The  model  is  assumed  to  be  of  the  form 


*(t)  =  a^xft-r)  +  k 


x(t+s)ds  +  u  (t) , 


with  initial  data 


xQ(s)  HI,  -r  <  s  <  0, 


and  output 


y(t)  =  x ( t) . 


The  true  parameters  aj  =  -3.0,  k*  »  -1.0  and  r*  =  1.0  were  esti¬ 
mated  using  start-ups  of 


-3.5, 


rN,0 


1.5. 


Runs  were  made  for  N  *  2,  4,  8  and  16.  The  MLE  algorithm  for  the 
AVE  scheme  did  not  converge  for  N  =  2  and  4.  However,  for  N  =  8 
and  16  the  AVE  scheme  converged  but  produced  rather  poor  parameter 
estimates.  The  SPLINE  scheme  converged  for  each  N  =  2,  4,  8,  16 
and  for  N  >_  4  produced  good  parameter  estimates.  The  numerical 
results  for  this  problem  are  summarized  in  Tables  7.2.1  and  7.2.2, 
where  eM  =  y^-y*  is  the  error. 
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Figures  7.2.1  through  7.2.4  compare  the  N  =  8  AVE  and  SPLINE 
data  fits.  In  particular.  Figures  7.2.1  and  7.2.2  show  the  N  =  8 
AVE  start-up  and  converged  data  fits,  respectively.  Figures  7.2.3 
and  7.2.4  show  similar  results  for  the  SPLINE  procedure. 


AVE  | 

N 

-N 

al 

1  eN  * 

2 

did  not 

converge 

4 

did  not 

converge 

8 

.8802 

.2182 

-4.1641 

2.0657 

16 

.9383 

-  .3806 

-3.5535 

1.2346 

Y*  = 

1.0000 

-1.0000 

-3.0000 

Table  7. 

2.1 

SPLINE  | 

N 

rN 

kN 

-N 

al 

leNl 

2 

.9100 

-  .4376 

-3.4478 

1.1002 

4 

.9896 

-1.0087 

-3.0580 

.0071 

8 

1.0018 

-1.0390 

-2.9953 

.0455 

16 

1.0042 

-1.0410 

-2.9841 

.0611 

Y*  = 

1.0000 

-1.0000 

-3.0000 

Table  7.2.2 


AVE  N 

*<t) 

*  ajX(t-r)  +  k  J0  x(t+s) 
-r 

ds  +  u  ^(t) 

MLE  ITR 

+  +  + 

data  a*  =  -3.0 

r*  =  1.0 

k*  = 

AVE  Model  a?’°  =  -3.5 

8,0  ,  c 

r  ’  =  1.5 

k8,( 

FIGURE  7.2.1 
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2.  00 


1.20 


0.H0 


-0.41 


-1.20 


-2.00  1 - 


0.00 


0.40 


0.80 


1.20  1.60  2. 


TIME 

FIGURE  7.2.2 


+  +  +  data  a*  ■  -3.0  r*  =  1.0  k*  = 

-  SPLINE  Model  a®’0  »  -3.5  r8,°  =  1.5  k8,0 


SPLINE  N  =  8 
MLE  ITR  =  6 


x(t)  »  ajx(t-r)  +  k 


+  +  +  data  a* 

-  SPLINE  Model  a\ 


f*  x(t+s)  ds  +  u  ^(t) 
-r 


-  -3.0  r*  =1.0  k*  =  -1. 

6  =  -2.9953  r8,6  =  1.0018  k8,6=-l 


Example  7.3 


In  our  final  example  we  consider  an  oscillator  with  retarded 
damping  and  retarded  restoring  forces.  We  seek  to  estimate  the 
coefficients  of  the  delayed  terms  and  the  time  delay  itself.  The 
system  is  governed  by  the  equation 


x(t)  +  16x(t)  +  agk(t-r)  +  a^x(t-r)  =  u  ^ ( t ) 


with  initial  data 


xn(s)  2  1,  *n(s)  20,  -r  <  s  <  0, 


and  output 


y(t)  =  x(t) 


i  This  second-order  equation  is  equivalent  to  the  two-dimensional 


system 


XjCt)  0  1  Xj 

x2(t)  16  0.  x2 


with  initial  condition 


"xll  T 1 

1  (s)  2 

x,  0 

L  tio 


o  0  x,(t-r) 


1  ‘afll  Lx2Ct'r2 


u  i (t) , 


-r  <  s  <  0 , 
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and  output 


y(t) 


[1  0] 


'^(t) 

x2(t) 


The  true  parameters  to  be  estimated  are  =  10.0,  a£  =  -10.0 
and  r*  =  1.0.  Start-up  values  for  each  run  were 


a 


N ,  0 
0 


11.0, 


-9.0, 


N ,  0 
r  ’ 


1.2. 


Convergence  results  for  this  example  are  summarized  in  Tables  7.3.1 
and  7.3.2.  At  N  =  16  the  relative  error  (|e^|/|y*|)  for  AVE  is 

approximately  3.5%,  while  the  N  =  16  SPLINE  scheme  produced  a 
relative  error  of  less  than  1%. 

Figures  7.3.1  and  7.3.2  show  the  N  =  4  converged  data  fits  for 
AVE  and  SPLINE,  respectively.  For  N  >  8,  the  data  fits  are  nearly 
perfect  and  are  not  shown. 


AVE 

N 

ao 

-N  -N 

al 

laNl 

2 

did  not  converge 

4 

54 . 5124 

-9.1876  2.4190 

46.7439 

8 

19.4941 

-9.4927  1.3506 

10.3520 

16 

10.6433 

-9.9089  .9998 

.7346 

y*  = 

10.0000 

-10.0000  1.0000 

Table  7.3.1 


SPLINE 

N 

iN 

ao 

aN 

-N 

I»NI 

2 

9.2585 

-10.5360 

1.0908 

1.3683 

4 

10.0927 

-10.0619 

1.0076 

.1622 

8 

9.9724 

-10.0177 

1.0010 

.0463 

16 

9.9811 

-10.0108 

1.0017 

.0314 

y*  = 

10.0000 

-10.0000 

1.0000 

Table  7.3.2 


3.  00 


r- 


j 


> 

! 


4 


E 


J 


2.40 


1 . 80 


1.20 


0.60 
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